Goto

Collaborating Authors

 Gaborone


Urban Safety Perception Through the Lens of Large Multimodal Models: A Persona-based Approach

arXiv.org Artificial Intelligence

Understanding how urban environments are perceived in terms of safety is crucial for urban planning and policymaking. Traditional methods like surveys are limited by high cost, required time, and scalability issues. To overcome these challenges, this study introduces Large Multimodal Models (LMMs), specifically Llava 1.6 7B, as a novel approach to assess safety perceptions of urban spaces using street-view images. In addition, the research investigated how this task is affected by different socio-demographic perspectives, simulated by the model through Persona-based prompts. Without additional fine-tuning, the model achieved an average F1-score of 59.21% in classifying urban scenarios as safe or unsafe, identifying three key drivers of perceived unsafety: isolation, physical decay, and urban infrastructural challenges. Moreover, incorporating Persona-based prompts revealed significant variations in safety perceptions across the socio-demographic groups of age, gender, and nationality. Elder and female Personas consistently perceive higher levels of unsafety than younger or male Personas. Similarly, nationality-specific differences were evident in the proportion of unsafe classifications ranging from 19.71% in Singapore to 40.15% in Botswana. Notably, the model's default configuration aligned most closely with a middle-aged, male Persona. These findings highlight the potential of LMMs as a scalable and cost-effective alternative to traditional methods for urban safety perceptions. While the sensitivity of these models to socio-demographic factors underscores the need for thoughtful deployment, their ability to provide nuanced perspectives makes them a promising tool for AI-driven urban planning.


Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

arXiv.org Artificial Intelligence

Geo-entity linking is the task of linking a location mention to the real-world geographic location. In this paper we explore the challenging task of geo-entity linking for noisy, multilingual social media data. There are few open-source multilingual geo-entity linking tools available and existing ones are often rule-based, which break easily in social media settings, or LLM-based, which are too expensive for large-scale datasets. We present a method which represents real-world locations as averaged embeddings from labeled user-input location names and allows for selective prediction via an interpretable confidence score. We show that our approach improves geo-entity linking on a global and multilingual social media dataset, and discuss progress and problems with evaluating at different geographic granularities.


Do Large Language Models Latently Perform Multi-Hop Reasoning?

arXiv.org Artificial Intelligence

We study whether Large Language Models (LLMs) latently perform multi-hop reasoning with complex prompts such as "The mother of the singer of 'Superstition' is". We look for evidence of a latent reasoning pathway where an LLM (1) latently identifies "the singer of 'Superstition'" as Stevie Wonder, the bridge entity, and (2) uses its knowledge of Stevie Wonder's mother to complete the prompt. We analyze these two hops individually and consider their co-occurrence as indicative of latent multi-hop reasoning. For the first hop, we test if changing the prompt to indirectly mention the bridge entity instead of any other entity increases the LLM's internal recall of the bridge entity. For the second hop, we test if increasing this recall causes the LLM to better utilize what it knows about the bridge entity. We find strong evidence of latent multi-hop reasoning for the prompts of certain relation types, with the reasoning pathway used in more than 80% of the prompts. However, the utilization is highly contextual, varying across different types of prompts. Also, on average, the evidence for the second hop and the full multi-hop traversal is rather moderate and only substantial for the first hop. Moreover, we find a clear scaling trend with increasing model size for the first hop of reasoning but not for the second hop. Our experimental findings suggest potential challenges and opportunities for future development and applications of LLMs.


Flickr Africa: Examining Geo-Diversity in Large-Scale, Human-Centric Visual Data

arXiv.org Artificial Intelligence

Biases in large-scale image datasets are known to influence the performance of computer vision models as a function of geographic context. To investigate the limitations of standard Internet data collection methods in low- and middle-income countries, we analyze human-centric image geo-diversity on a massive scale using geotagged Flickr images associated with each nation in Africa. We report the quantity and content of available data with comparisons to population-matched nations in Europe as well as the distribution of data according to fine-grained intra-national wealth estimates. Temporal analyses are performed at two-year intervals to expose emerging data trends. Furthermore, we present findings for an ``othering'' phenomenon as evidenced by a substantial number of images from Africa being taken by non-local photographers. The results of our study suggest that further work is required to capture image data representative of African people and their environments and, ultimately, to improve the applicability of computer vision models in a global context.


How to Create Dummy Data in Python

#artificialintelligence

Dummy data is randomly generated data that can be substituted for live data. Whether you are a Developer, Software Engineer, or Data Scientist, sometimes you need dummy data to test what you have built, it can be a web app, mobile app, or machine learning model. If you are using python language, you can use a faker python package to create dummy data of any type, for example, dates, transactions, names, texts, time, and others. Faker is a simple python package that generates fake data with different data types. Faker package is heavily inspired by PHP Faker, Perl Faker, and by Ruby Faker.


Andile Ngcaba's inq Wants to be Africa's Number one AI Service Provider.

#artificialintelligence

ICT industry veteran Andile Ngcaba's inq., a Pan-African digital service provider, wants to be Africa's number one artificial intelligence (AI) service provider. The company has points of contacts in 12 African cities, Johannesburg, Gaborone, Lusaka, Ndola, Blantyre, Lilongwe, Mzuzu, Lagos, Abuja, Port Harcourt, Kanu and Abidjan. It has concluded the 100% acquisition of Vodacom Business Africa's operations in Nigeria, Zambia and Cote d'Ivoire with a further planned acquisition in Cameroon pending regulatory approvals. At the time of the announcement of the transaction last June, inq. said this deals represents a significant milestone to its vision to be a leading provider of cloud and digitally based services in key markets across sub-Saharan Africa and provides additional vital assets in its build-out of a regional footprint. Today, inq. said this landmark transaction grows inq.'s regional footprint to 13 cities in 7 countries across Africa including its existing operations in Botswana, Malawi and Mozambique.


How Big Data and machine learning serves consumer wanderlust

#artificialintelligence

It is no surprise that data analytics and machine learning are fast becoming key components of every innovative company's toolkit, given the massive increase in the amount of data that companies are generating. Because of the sheer volume and complexity of data being created, it is often beyond human capacity to find relevant trends or insights within what has been tagged as'Big Data'. Notably, one of the big differences between machine learning and computer-assisted analysis (where humans are involved) is that the recent breakthroughs in machine learning enable computers to teach themselves how to solve problems. So previously, when humans were directing computers, they were limited to very direct questions and answers (for example, "what is my top selling item?") and required the person using the machine to dictate which method to use to the solve the problem. Now, machine learning enables computers to find answers in ways that are unguided by human intervention.